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ABSTRACT 

We analyse the correlation function of mock galaxy clusters in redshift space. We con- 
structed several mock catalogues designed to mimic the selection biases inherent in 
a variety of observational surveys. We explore different effects that contribute to the 
distortion of the clustering pattern; the pairwise velocity distribution of galaxy sys- 
tems, coherent bulk motions, redshift errors and systematics in cluster identification. 
Our tests show that the redshift-space clustering pattern of galaxy systems is highly 
influenced by effects associated with the identification procedure from two dimensional 
surveys. These systems show a spuriously large correlation amplitude, an effect that 
is present and even stronger in a subsample whose angular positions coincide with 
3-dimensional identified clusters. The effect of a small number of redshift measure- 
ments is also that of increasing the correlation amplitude. In a similar fashion, the 
bias parameter inferred from cluster samples is subject to these observational prob- 
lems which induce variations of up to a factor of two in such determinations. Also, 
we find that the estimated mean pairwise velocity dispersion can be up to an order 
of magnitude larger than the actual value. Errors in the estimated cluster redshift, 
originating from the use of too few redshift measurements per cluster, have a smaller 
impact on the measured correlation function. We show that an angular incompleteness 
in redshift surveys, such as that present in the 2dFGRS 100k public release, has no 
significant effect in the results. We suggest that the nature of projection effects arise 
mainly from structures along the line of sight in the filamentary large-scale clustering 
pattern. Thus, spectroscopic surveys are the only means of providing unbiased cluster 
samples. 

Key words: methods: statistical - methods: numerical - large-scale structure of 
Universe - galaxies: clusters: general 



1 INTRODUCTION 

Rich clusters of galaxies form at the highest peaks of the 
mass distribution through physical processes which are sim- 
pler than those involved in galaxy formation. As a conse- 
quence, models show that the correlation function and power 
spectrum of clusters of galaxies have the same shape as that 
of the mass, with an amplitude depending on a small number 
of parameters (see for example Colberg et al. 2000, Sheth, 
Mo & Tormen 2001, Padilla & Baugh, 2002). In fact, the 
space density of clusters of galaxies is usually the only pa- 
rameter needed to obtain a direct estimate of the underlying 
mass distribution (Padilla & Baugh, 2002). This shows the 
clear advantage of studying clusters of galaxies, which has 



made them popular tracers of the large scale structure of 
the Universe in the literature. 

Cluster distances are derived from member galaxy red- 
shifts, which are affected by galaxy peculiar velocities. Even 
if all the member galaxy redshifts are used in the deter- 
mination of the cluster distance, this will be still affected 
by the cluster peculiar motion, and will not be a true dis- 
tance measurement. The distribution function of cluster pe- 
culiar motions depends upon the value of the mass density 
parameter Q, for models with similar mass fluctuation am- 
plitudes (Croft & Efstathiou 1994a; Bahcall, Cen & Gra- 
mann 1994). These motions, either for galaxies or clusters 
of galaxies, produce an apparent distortion of the clustering 
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pattern as measured by the two-point correlation function 
in redshift space, £(<?, 7r), where a and tv are the separations 
perpendicular and parallel to the line of sight, respectively. 
At small separations, non-linear, nearly virialized regions 
produce elongations along the line of sight which allow mea- 
surements of the one-dimensional pairwise rms velocity dis- 
persion, < w? 2 > 1//2 (Davis & Peebles 1983). Larger scales 
are dominated by the infall onto overdense regions in the 
form of bulk motions which result in a compression of the £ 
contours along the direction of the line of sight. 

Evidence of strong anisotropies in the correlation func- 
tion of Abell clusters has been given by several authors (see 
for instance Postman, Huchra & Geller 1992 and references 
therein). Systematic effects in the Abell catalogue originate 
in the superposition of clusters along the line of sight and 
generate a bias in the observed correlation function of the 
catalogue (Sutherland 1988; Sutherland & Efstathiou 1991; 
see also Lucey 1983). Using mock catalogues from numer- 
ical simulations, van Haarlem et al. (1997) showed that a 
large fraction of Abell clusters of richness class R > 1 would 
not be physically bound systems and would suffer from con- 
tamination by galaxies and groups along the line of sight. 
Therefore it is likely that the observed large anisotropies 
could be mainly produced by systematics in cluster detec- 
tion algorithms from angular galaxy catalogues. 

The construction of objectively defined cluster cata- 
logues drawn from machine-scanned survey plates with bet- 
ter calibrated photometry brought a new generation of clus- 
ter catalogues which are believed to be less affected by iden- 
tification biases (APM: Dalton et al. 1992, 1994, 1997; Cos- 
mos: Lumsden et al. 1992). The typical radius used to de- 
fine clusters in the machine based catalogues is significantly 
smaller than that used by Abell, reducing the enhancement 
of cluster richness by projection effects. The clustering signal 
found in these more recent cluster redshift surveys does not 
display large enhancements along the line of sight; further- 
more, the trend of increasing correlation amplitude with de- 
creasing space density of clusters is weaker than that found 
for Abell clusters (Croft et al. 1997) and is similar to the 
trend expected in current models of structure formation. A 
similarly weak dependence of correlation length on cluster 
space density is found in X-ray selected cluster catalogues, 
such as the X-ray Bright Abell Cluster Sample (XBACS) 
and the REFLEX sample (Collins et al. 2000), which are 
less susceptible to line of sight projection effects than the 
optically selected Abell catalogue (Abadi, Lambas & Muriel 
1998; Borgani et al. 1999). However, Miller et al. (1999) ar- 
gue that the early redshift surveys of Abell clusters contain 
large fractions of low richness clusters (Abell richness class 
R — 0), which were not intended to form complete samples 
suitable for statistical analyses. Miller et al. (1999) present 
the clustering analysis of a new redshift survey of Abell clus- 
ters with richness R > 1, and with the majority of cluster po- 
sitions determined using several galaxy redshifts. The clus- 
tering signal along the line of sight is greatly reduced in the 
new redshift surveys compared with the Bahcall & Soneira 
(1983) results, and is comparable to the amount of distor- 
tion of the clustering pattern found for APM clusters (see 
Fig 5 of Miller et al. 1999). The anisotropy is further reduced 
after the orientation of two superclusters that are elongated 
along the line of sight is changed. Peacock & West (1992) 
also found that restricting the attention to higher richness 



Abell clusters removed the strong radial anisotropy seen in 
the clustering measured in the earlier surveys. 

Another possible source of systematics could rely on 
the fact that several cluster distances are determined by a 
single galaxy redshift, usually the brightest cluster member. 
Therefore it is important to explore the effects of cluster 
distance errors on the analysis of correlation function of rich 
clusters. 

The gravitational amplification of small primordial fluc- 
tuations has been analysed through numerical simulations 
to explore the spatial distribution of clusters (e.g. White et 
al. 1987; Bahcall & Cen 1992; Croft & Efstathiou 1994b; 
Watanabe, Matsubara & Suto 1994; Eke et al. 1996). These 
early studies do not reach a consensus on the predicted clus- 
tering of clusters in cold dark matter cosmologies. Part of 
the reason for this discrepancy is due to differences in the 
way in which clusters are identified in the simulations (Eke 
et al. 1996). More recent studies have made use of much 
larger volumes than in these earlier studies, with sufficient 
resolution to allow the reliable extraction of massive dark 
matter haloes that can be identified as rich clusters (Gover- 
nato et al. 1999; Colberg et al. 2000, Jenkins et al. 2001). 

In this paper, we analyse the redshift space clustering 
of massive dark matter haloes in mock catalogues extracted 
from the rCDM Hubble Volume simulation. In particular, 
we use the mock catalogues made available by the Virgo 
Consortium, which contain galaxy angular positions and dis- 
tances, with a redshift distribution set by a selection func- 
tion. In this work we will make use of these mock catalogues 
to analyse in detail the effects of projection biases on cata- 
logues of clusters identified in two dimensions. We also pre- 
dict the results of the new generations of cluster catalogues 
with a high degree of spectroscopic completeness such as the 
2dF Galaxy Redshift Survey (Norberg et al. 2002). 

The outline of this paper is as follows: section 2 de- 
scribes the equations which allow us to obtain a statis- 
tical description of the redshift-space correlation function 
as a function of coordinates parallel and perpendicular to 
the line of sight, through the pairwise velocity dispersions, 
redshift-space correlation length and bias parameters; sec- 
tion 3 presents the correlation function measurement tech- 
nique. Section 4 presents the different mock cluster samples 
constructed using several cluster identification algorithms, 
and studies the results arising from using different number 
of redshifts to derive cluster distances and also different val- 
ues of the search radius in cluster identification. In this sec- 
tion, we also study the dependence of the relative pairwise 
velocities, redshift-space correlation length and bias factors 
as a function of the number of redshifts used in the determi- 
nation of the cluster distances. In section 5 we discuss our 
results and present the main conclusions drawn from this 
work. 



2 REDSHIFT-SPACE CORRELATION 
FUNCTION ANISOTROPY 

The redshift-space correlation function can be calculated as 
a function of the pair separation parallel and perpendicular 
to the line of sight. This approach has led to the quantifi- 
cation of characteristics of the redshift-space distribution of 
galaxy and clusters of galaxies, such as the "fingers of God" , 
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which are elongated structures seen in redshift surveys, orig- 
inated from the random motions of galaxies inside clusters. 

The strength of the "fingers of god" effect depends on 
the pairwise velocities of galaxies, which so far has been 
measured using two different methods. The first method 
corresponds to that adopted by Loveday et al. (f996) and 
Ratcliffe et al. (1998), which compares a theoretical expres- 
sion for the correlation function with the measured values, 
(, m (a,7v) (the index m indicates measured quantities), in a 
grid of values of distances parallel and perpendicular to the 
line of sight. The other approach presented by Padilla et 
al. (2001), compared the isopleths (curves of equal corre- 
lation function amplitude) in the measured and predicted 
correlation functions. The £(<r, tt) contour levels are approx- 
imated by the functions r m (8) and r p (9) for the measured 
and predicted correlations respectively. Here 9 is the polar 
angle measured from the direction perpendicular to the line 
of sight (Padilla et al. 2001), such that £ x (a r , tiy) -£ fix = 0, 
where ay = r x fix (6) cos(0), 7iy = r X f ix (0) sin(#), and the in- 
dex x indicates either measured (m) or predicted quantities 

(P)- 

In cither method, the measured correlation function 
£ m (cr, 7r) is compared with the convolution of the real-space 
correlation function, £(r) with the pairwise velocity distri- 
bution function, /(to), following Bean et al. (1983). We cal- 
culate 



/oo 
[1 + CM1/K -w a (r,r')]dw', 
-OO 



(1) 



where r 2 = r' 2 + a 2 , and Ho is the Hubble constant, r' = 
tt — w' /Ho (the prime denotes the line-of-sight component 



of a vector quantity) and 
W 3 (r,r')^-tfo/3£(r)(l + e(r)rV 



(2) 



is the mean streaming velocity of galaxies at separation r. 

Following usual procedures we calculate the best-fit rms 
peculiar velocity, < w 2 > 1,>2 , for an exponential distribution, 



exp 



-y/2- 



< W 2 >V2 



(3) 



We adopted this pairwise velocity distribution as it has 
shown to be the most accurate fit to the results from nu- 
merical simulations (Ratcliffe et al. 1998). 

Two estimates of the real-space correlation function are 
usually adopted in equation 1: the inversion of the projected 
correlation function (Baugh 1996, Ratcliffe et al. 1998), or 
a simple power-law fit obtained from the angular correla- 
tion function (Padilla et al. 2001). In this work we use the 
theoretical correlation function of the simulated mass dis- 
tribution and adopt a linear bias parameter b that relates 
clusters and mass by Fourier transforming the model power 
spectrum 



t-CDM , \ 1 ,2 



kr 



(4) 



We use this to evaluate the theoretical prediction for £ p (a, n) 
using equation (1). We search for the optimum values of 
the scale independent bias parameter 6 and < w 2 >^ 2 by 



minimising the quantity x > 
X 2 = ^[rr(^)-rf(^)] 2 , 



(5) 



where we have chosen to compare a set of discrete levels, 
/ = 0.6,0.8,1.0,1.2 and 1.4, of the redshift-space correla- 
tion function £(<r, tt) amplitude instead of comparing values 
of the correlation function on a grid of a and tt distances. 
Our choice is based on the fact that more reliable and sta- 
ble results are obtained using this technique (Padilla et al. 
2001). 



3 MEASURING £(<7,tt) 

We have computed £ m (o", tt) as a function of the separa- 
tion perpendicular (a) and parallel (it) to the line of sight. 
To compute £ m (cr, 7r), we generate a random catalogue with 
the same angular limits and radial selection function as the 
sample of objects. We cross correlate data-data and random- 
random pairs (Ndd and N rr respectively) binning them as 
a function of separation in the two variables a and tt. Our 
estimate of £°(<t, tt) is (Davis & Peebles 1983): 



N dd n R 



(6) 



where no and tir are the number of data and random points 
respectively. 

This estimator is affected by uncertainties in the mean 
density, in particular on large scales where £(r) is small (eg. 
Hamilton 1993). However, since our analysis is confined to 
small separations where the correlation function has a large 
amplitude, our results are insensitive to the choice of esti- 
mator. 

Figure 1 shows the contours of equal amplitude of the 
2-point correlation function of groups identified in a numer- 
ical simulation (left panel) and simulation particles (right 
panel) in the coordinates a and tt (mock galaxy and group 
samples from Padilla et al. 2001). The compression of the 
iso-correlation curves for groups in the tt direction can be 
appreciated, indicating a lack of high pairwise velocities in 
this sample. From the comparison with results derived from 
clusters identified in two-dimensional surveys (see for in- 
stance Bahcall, Soneira & Burgett, 1986) the existence of 
either large pairwise velocities or large projection biases is 
clear, as also discussed by Sutherland (1988). 

This elongation could originate in the systematic pres- 
ence of groups along the line of sight, in the fields of clusters 
identified from angular data. From the theoretical point of 
view, such strong elongations along the line of sight are not 
expected in a hierarchical scenario of structure formation. 
This is confirmed by the compression observed in left hand 
side panel of figure 1 and provides a clear evidence for the 
infall of groups onto larger structures (see also Peacock et al. 
2001, for galaxies). As the contours are well defined for all 
the correlation function levels shown in this figure (£ = 0.6, 
0.8, 1.0, 1.2 and 1.6), the analysis developed in the next sec- 
tions will be based on the results at these contours. In the 
right hand side panel, the large distortion seen in the iso- 
correlation curves in the tt direction is due to the larger pe- 
culiar velocities of the simulation particles and their smaller 
correlation amplitude, associated with the non-linear evolu- 
tion of the density field on small scales. 
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Figure 1. The 2-point correlation function of simulation groups (left hand side panel) and simulation particles (right panel) in the 
coordinates a and 7r (mock group and galaxy samples from Padilla et al., 2001). From light to dark, the transitions between different 
shadings correspond to fixed level values of £ = 0.6,0.8, 1.0, 1.2, and 1.4, which are later used to infer the relative pairwise velocities, 
w\2 ■ The thick line corresponds to the £ = 1 level, and the thin lines show the expected contours in the absence of peculiar velocities. 



4 RESULTS FROM MOCK CATALOGUES 
EXTRACTED FROM N-BODY 
SIMULATIONS 

In this section, we analyse the statistical properties of clus- 
ter samples drawn from mock catalogues. These mock cat- 
alogues have been obtained from one of the mock galaxy 
catalogues extracted from the rCDM Hubble Volume sim- 
ulation by the Durham Extragalactic Group (Evrard et al., 
2002) following a procedure similar to that used in Cole 
et al. (1998). This simulation follows the evolution of cold 
dark matter (CDM) density fluctuations in a rCDM cos- 
mology, with parameters flo — 1.0, a cosmological constant 
A c 2 /(3-ffo ) = 0, and a power spectrum described by a shape 
parameter of F — 0.21 and as = 0.6. The huge volume of 
the simulation (8/i _3 Gpc 3 ) and the large number of par- 
ticles employed (10 9 ) allow cluster statistics to be studied 
with unprecedented accuracy (Colberg et al. 2000; Jenkins 
et al. 2001). 

The particular mock catalogue we chose to apply the 
different identification methods to, is that of the APM sur- 
vey, with a selection function consistent with a limiting mag- 
nitude bj = 20.0. 

This particular mock catalogue has the advantage that 
the derived galaxy apparent magnitudes have k + e correc- 
tions which reproduce the results of the 2dFGRS (see Nor- 
berg et al. 2002, for details). Also we notice that the observer 
is not chosen at random but instead constrained to lie in a 
region with similar properties to those of the Local Group. 



4.1 Mock Groups identification 

We use three main algorithms to search for groups in the 
numerical simulations, and construct four samples of clusters 
from them: 



• Sample 1, is constructed using a friends-of- friends algo- 
rithm applied to the 3-dimensional distribution of particles 
in the simulation cube. We used a linking length 6 = 0.2, 
expressed in terms of the mean interparticle separation, ad- 
equate for the rCDM cosmology (Helly et al., 2003). This 
sample is almost completely free of spurious clusters since 
we have chosen to study only those groups with at least 
60 member particles. The fraction of unbound associations 
found using FOF drops quickly as the number of member 
particles rises over 10. 

Given the lack of a significant number of spurious clusters, 
the measurements of anisotropies in the correlation function 
should clearly show the expected infall pattern. As we do not 
have computational access to the full Hubble Volume sim- 
ulations, wc applied the FOF identification algorithm to a 
numerical simulation with box size 600h _1 Mpc with exactly 
the same cosmological parameters as the rCDM simulation. 
This ensures that this particular sample of clusters is statis- 
tically comparable to those obtained from the mock galaxy 
catalogue produced by the Durham group. 

• Sample 2 comprises the results from the identification 
of groups in a mock catalogue that includes a selection 
function, and where the position of particles is determined 
from redshifts, that is including particle peculiar velocities. 
We use a friends-of-friends algorithm with different link- 
ing lengths in the directions parallel and perpendicular to 
the line of sight. This procedure emulates the identifica- 
tion process used in the construction of the group sam- 
ple obtained from the Updated Zwicky Catalogue, which 
is based on an algorithm described by Huchra & Geller 
(1982). In this case we use different linking lengths in the 
directions perpendicular and parallel to the line of sight, 
which vary linearly with distance, do = V/V/id0.229h _1 Mpc 
and vq = V/V/id350km/s, where V is the distance, and 
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Figure 2. Galaxy density and clusters in the mock catalogues. The density of galaxies in the pixelmap is proportional to the darkness 
of the pixels in this figure. The double-line circles show clusters from mock sample 3, whereas thick-line circles indicate the positions of 
clusters in mock sample 2. This figure shows a sub-set of the pixclised mock galaxy catalogue and is shown in an equal area projection 
centred at a = l h and <5 = —45°, with radius r eqa = 160°. The number of pixels per side is n v i x = 2900, and was set in order to obtain 
a mean number of galaxies n = 1 per pixel. The pixelmap sub-set size is 170x90 pixels. 



Vfid — 7000km/s. These values correspond to a density con- 
trast 8p/p ~ 80. 

• Sample 3 is obtained using the algorithm presented by 
Lumsden et al.(1992), which is used in identifying clusters 
from the COSMOS catalogue. Following the prescription de- 
scribed by Lumsden et al. (1992), we use an angular search 
radius r c = 1.0h _1 Mpc, and apply this procedure to the 
same mock catalogues as the second algorithm. We tested 
this identification algorithm by applying it to the COSMOS 
galaxies, and comparing the outcome of the identification al- 
gorithm to the positions of the EDCC clusters (Lumsden et 
al. 1992). We found out that ~ 75% of the EDCC clusters 
were re-obtained by our identification procedure. We also 
applied this to a numerical simulation, and found that only 
~ 70% of the clusters identified in the simulation cube with 
a FOF algorithm were found by our angular identifier. More 
importantly, only ~ 60% of the clusters identified are real 
clusters. These numbers illustrate the degree of projection 
effects sample 3 is subject to, and its study will serve as a 
test of how these effects translate into the cluster correlation 
function in redshift-space. 

• Sample 4: We extract from sample 3 the subset of clus- 
ters whose angular positions are coincident (within the clus- 
ter identification radius) to those in sample 2. This sample 
is derived from a 2 dimensional catalogue and is confirmed 
to have a redshift space identification at the same angu- 
lar position. Cluster distances are determined by using the 
galaxy members identified from the angular position data. 
This sample should be less affected by projection effects than 
that obtained from the angular data alone. However, clus- 
ter distances may include particles that are not physically 



related to any particular cluster at all. We find that only 
~ 50% of the clusters from sample 3 are present in this 
sample indicating that this procedure provides a significant 
restriction. We also notice that this fraction is slightly differ- 
ent than that resulting from comparing the sample 3 clusters 
with those obtained from using the FOF algorithm on a full 
simulation box. 

All the samples defined in this section include groups with 
a minimum mass threshold. This threshold is set so as to 
make the space density of all the samples as close to that of 
sample 1 as possible. 

In figure 2. we show the different results from the iden- 
tification of clusters of galaxies in the mock catalogue using 
angular and 3-D galaxy positions. This figure shows the den- 
sity of galaxies in the mock catalogue in pixels, and clusters 
of galaxies in circles. The gray scale of the pixels indicate 
number of galaxies; the darker the pixel the higher is the 
number of galaxies in it. As can be seen, the circles enclose 
regions of high density of galaxies. The different line styles 
with which the circles are drawn correspond to the different 
algorithms used in the identification of clusters. The double- 
line circles correspond to clusters from sample 3, that is, 
clusters identified using angular data. In this case, the ra- 
dius of the circle corresponds to the radius out to which the 
counting of galaxies was done when measuring the cluster 
richness. The thick line circles show clusters from sample 2, 
identified from 3-dimensional positions affected by the pe- 
culiar velocities of the clusters. In this case, the radius of 
the circle is proportional to the richness of the clusters. By 
inspection to this figure, we can appreciate the large fraction 
of clusters from sample 3 whose angular positions are not co- 
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Figure 3. £(<r, 7r) from simulated catalogues of clusters of galaxies with distances measured using 10 member galaxies, obtained using 
different algorithms. Upper left panel shows the results from sample 1, upper right panel from sample 2, lower left panel shows the 
contours obtained from sample 3, and the lower right panel, those from sample 4. Shadings and line conventions are as in figure 1. 



incident with any 3-dimensional cluster within one angular 
search radius. 

The aim of constructing the mock cluster samples pre- 
sented here is to understand and reproduce the results ob- 
tained from observational cluster samples. In particular, we 
intend to establish the reason for the large elongations along 
the line of sight present in the measured correlation func- 
tions. The different behaviours of the correlation functions 
found for the four mock samples are shown in figure 3, where 
we used clusters of galaxies with distances measured using 
10 member galaxies. The results from samples 1 and 2 show 
the expected infall pattern. This simply indicates that the 
FOF algorithm still works well when applied to a catalogue 
which incorporates a selection function and includes the ef- 
fects of galaxy peculiar velocities. The infall is not so ev- 
ident when inspecting the remaining panels, which simply 
show the different degrees of projection effects the different 
samples are subject to. It can be seen that samples 3 and 



4 show somewhat large elongations along the line of sight. 
Taking into account that all panels show the results for the 
same value of n z = 10, this is a clear signature of projection 
effects. The fact that sample 4 shows elongated contours 
along the line of sight is indicative of substantial projection 
contamination. Given that we have constructed this sample 
by requiring an angular coincidence between 2-dim and 3- 
dim identified clusters, such anisotropies reflect the fact that 
there is a strong contribution by other structures along the 
line of sight in producing the observed distortion pattern. 



In the following section we provide detailed descriptions 
of the characteristics of each sample, analysing the differ- 
ences in elongations arising from using different n z . 
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4.2 Effects of observed number of redshifts 

In order to make a thorough comparison between the mock 
cluster samples and the observational correlation functions, 
we study in detail the effects of using different n z in each of 
our mock samples. 

We first show the results from sample 1 which com- 
prises clusters of galaxies identified from the 3-dimensional 
distribution of particles in the full numerical simulation. Fig- 
ure 4 shows the correlation function contours when the dis- 
tances to the clusters in this sample have been calculated 
using I (left) and 20 (right panel) member redshifts. As can 
be seen, the differences between the panels are quite small. 
The left panel only shows slight evidence of an elongation 
along the line of sight. This is an effect produced by the er- 
rors in the distance to these clusters arising from the use of 
only one redshift to determine their distances. By compar- 
ison between the left upper panel in figure 3 and the right 
panel in figure 4 (that is, Sample I with n z — 10 and 20), it 
can be seen that the use of a small n z = 10 is sufficient to 
erase spurious elongations along the line of sight, and that 
considering more redshifts per cluster makes no significant 
difference. This result once more reflects the reliability of the 
FOF algorithm in finding clusters of galaxies in numerical 
simulations. 

We now investigate the effects of changing n z in sample 
2, which consists of clusters identified from a mock galaxy 
catalogue which incorporates a selection function. We show 
in figure 5 the correlation function contours for samples with 
distances obtained using I and 20 redshifts in the left and 
right hand side panels respectively. As can be seen in this 
figure, the effects of using a small n z are not critical. In 
the case of using I galaxy, the infall pattern can still be 
seen, specially in the correlation contours corresponding to 
£(cr, 7r) < I. The inclusion of further redshifts only makes a 
small change, although it can be seen that in this case, this 
signature is also visible for levels of higher correlation func- 
tion. When comparing with figure 4 one can notice a neat 
infall pattern in the clusters from sample I for high levels 
of correlation even when n z = 20 (a very clear signature for 
£ = 1 contour levels, and mildly visible for £ = 1.4). The case 
of sample 2 is less ideal, since this infall pattern can only be 
marginally seen for a correlation function contour level cor- 
responding to £ = 1, which simply reflects the higher degree 
of difficulty in identifying clusters from a galaxy catalogue 
which incorporates a selection function. 

It should be noted that sample 2 is representative of 
cluster or group catalogues identified in galaxy redshift sur- 
veys, as is the case with UZC groups. Therefore, the infall 
of structures which is a signature of the hierarchical cluster- 
ing obtained in the correlation function contours measured 
using the UZC groups sample (Padilla et al. 2001), can be 
assessed by the results of sample 2. In other words, the study 
arising from sample 2 results in a correlation function which 
is not badly affected by projection effects, even for small 
n z , which also indicates that the use of a small n z does not 
induce important elongations along the line of sight. 

We next consider a mock cluster catalogue drawn from 
an angular galaxy catalogue (sample 3). Here the distances 
to each cluster were obtained using the redshifts of galaxies 
identified as cluster members using angular data. The cor- 
relation function of these catalogues are shown in figure 6, 



where we show the results when using 1, 5, 10, and 20 galaxy 
redshifts in determining cluster distances in the upper left, 
upper right, lower left and lower right panels respectively. 

The results in this figure show significant differences in 
the elongations for the different samples. In the case where 
only 1 galaxy redshift is used to assess the cluster distance, 
the elongation along the line of sight is severe, and not en- 
tirely different to what was found for Abell clusters with 
distances estimated using less than 10 galaxy redshifts (see 
for instance Bahcall, Soneira & Burgett, 1986). The use of 5 
galaxies already makes a marked difference but would still 
not be sufficient for an infall pattern to be found. For com- 
parison, the number of Abell clusters with n z > 5 is small, 
about 20 % of the total sample. The first, faint traces of an 
infall can be seen when considering 10 galaxy redshift mea- 
surements, and only for small values of £(cr, it) < 1, which 
are difficult to obtain from an observational sample without 
large contributions from noise. 

The main sources of elongation cannot be considered to 
be the error in the cluster position arising from using a small 
n z . This is so since, for instance, we have already shown in 
figure 5 that the use of a very small number of galaxies 
is still not enough to erase the infall pattern. It is more 
likely that the elongation is produced by the inclusion of non 
member galaxies in the determination of cluster distances, 
the worst scenario being that in which only a fraction of the 
identified clusters correspond to physically bound galaxies, 
and the remainder being constructions of projection effects 
that cause the spreading of galaxies along the line of sight 
to appear as coherent structures, as only angular data is 
available. 

We now consider what are the expected effects of the 
number of redshift measurements in samples of clusters iden- 
tified in two dimensions with angular positions coincident 
with 3-dimensional identification. 

Figure 7 shows the correlation function contours for 
clusters in mock sample 4. The different panels correspond 
to different n z , in the same order as the previous figure. 

As can be seen in figure 7, the projection effects in sam- 
ple 3 are still present, even after the clusters have coincident 
angular positions with a cluster sample identified in 3D. The 
elongations in £ (a, tt) are quite severe, and reflect the con- 
tamination of foreground and background structures, which 
introduces biases in the cluster distances. 

4.3 Effects of projected radius in the 
identification algorithm 

The algorithms used to identify clusters from angular data 
depend quite sensitively on the search radius r c used to find 
member candidates. In the case of Abell clusters this corre- 
sponds to r c — 1.5h _1 Mpc. 

A possible cause of projection effects becomes clear 
when one considers that the density of galaxies diminishes as 
a function of cluster radius. This means that in the outskirts 
of the cluster, the angular population of galaxies will be com- 
posed of ever larger fractions of foreground and background 
galaxies. Therefore it is increasingly probable to consider as 
member galaxies, objects that are not physically bound to 
the cluster. We investigate this effect by using different clus- 
ter search radius in our 2-dimensional cluster finding algo- 
rithms. In figure 8, we show the correlation function contours 
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Figure 4. The correlation function contours for clusters identified using the FOF algorithm from the simulation (mock sample 1). The 
different panels correspond to different n z : 1 redshift measurement (left) and 20 (right) member rcdshifts. Shadings and line conventions 
are as in figure 1. 
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Figure 5. Correlation function contours for the cluster mock sample 2 which comprises clusters identified using a modified version of 
the FOF algorithm, applicable to a mock redshift survey, which includes peculiar velocities and a selection function. We show the results 
for cluster distances obtained using 1 and 20 redshifts in the left and right panels respectively. Shadings and line conventions are as in 
figure 1. 



for clusters identified using r c — 0.5, 1.0 and 1.5h _1 Mpc in 
the left, middle and right panels respectively. In all these 
cases, n z — 10, which ensures us that any elongations ob- 
served in the £(cr, ir) isopleths will be mainly produced by 
projection effects arising from the choice of radius. 

As can be seen, the use of a small search radius is not en- 
tirely convenient as it produces spurious clusters, probably 
from confusing satellite haloes with proper clusters. This is 
in agreement with the elongated iso-correlation contours in 



the left panel, and also with the small amplitude of the cor- 
relation length (To, that is £(co, 7T = 0) = 1, and is supported 
by figure 9, which shows the clusters of galaxies identified 
using r c = 1.0h _1 Mpc (circles), and those identified using 
r c = 0.5h _1 Mpc (dots); the gray scale of the pixels indicate 
number of galaxies, and the radii of the circles correspond 
to the radius out to which the counting of galaxies was done 
when finding the cluster richness. As can be seen in three 
cases in figure 9, the identification using a small search ra- 
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Figure 6. Correlation function isoplcths obtained from clusters identified from angular positions drawn from the numerical simulation 
(mock cluster sample 3), with 1, 5, 10, and 20 individual redshift measurements in panels upper left, upper right, lower left, and lower 
right respectively. Shadings and line conventions are as in figure 1. 



dius produces pairs of clusters whereas the use of a larger 
search radius produces a single cluster. The satellite cluster 
identified with a small search radius corresponds to a lower 
peak in the density of projected galaxies in all cases; also, 
the richness of a satellite is smaller than the main cluster 
in general, confirming the hypothesis of low mass satellites 
stated earlier in this paragraph. 



The other values considered, r c — 1.0 and 1.5h~ Mpc, 
produce similar results, both largely free from projection 
effects, and with a larger correlation length a (figure 8). 
There is a slight difference in the noise level in these sam- 
ples, the correlation levels in the r c = 1.5h _1 Mpc sample 
being less smooth than the smaller r c case. This is not a 
severe difference, but is enough to support our choice of 
r c — 1.0h _1 Mpc used throughout this paper. 



4.4 Relative velocities and correlation lengths in 
mock cluster samples 

In this section we present determinations of relative veloci- 
ties, redshift-space correlation lengths and bias factor from 
the £(cr, 7r) anisotropies. Specifically, we study the depen- 
dence of the relative pairwise peculiar velocities obtained 
using eq. 5 for the different mock samples as a function of n z . 
We recall that the cosmological parameters of the simulation 
from which the mock catalogues were extracted (fl m = 1, 
SIa = 0.0, r = 0.21 and erg = 0.6) are those used in the 
calculations. 

Figure 10 shows the pairwise velocities found in the dif- 
ferent samples (sample 1 in upper left panel, sample 2 in 
the upper right, sample 3 in the lower left, and sample 4 in 
the lower right panel) . The different thin lines correspond to 
results using different values of £(cr, n). The thick solid line 
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Figure 7. Correlation function results from clusters identified using mock angular data using 2D, and whose angular positions coincide 
with those of real clusters in the simulation (mock sample 4). Cluster distances were obtained using 1, 5, 10 and 20 rcdshifts (panels 
upper left, upper right, lower left, and lower right respectively). Shadings and line conventions are as in figure 1. 
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Figure 8. Correlation function contours for mock clusters identified from angular data, using r c = 0.5, 1.0 and 1.5h — 1 Mpc in the left, 
middle and right hand side panels respectively. Shadings and line conventions are as in previous figures. 
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Figure 10. Pairwise velocities obtained for the different samples as indicated in the legend of each panel. The different thin lines 
correspond to different values of £(<r, n). The thick solid lines show the mean pairwise velocity dispersion for the full sample of clusters 
in the 3-dimensional simulation box. 



shows the mean pairwise velocity dispersion from the clus- 
ters identified in three dimensions in the simulation box. 

This plot shows the great range of results which can be 
obtained from the contours in the correlation function if the 
mean n z used for the clusters in the sample is not known. It 
is remarkable the fact that the relative velocities found using 
clusters from sample 2 which were obtained in redshift space 
are in good agreement with the actual value measured from 
the full simulation. 

The disagreement between the value of true relative 
cluster velocities with the measurements from the redshift- 
space correlations is most severe when the sample has been 
identified using angular data. It is also interesting that the 
velocities obtained from sample 4 (clusters identified from 2 
dimensions with angular confirmation from inspection over 
the 3-dimensionally identified sample) are in some cases 
still very high even when their distances are obtained us- 
ing a large n z . It should be noted though, that the velocities 



favoured by low noise contour levels, £ > 1, suggest small 
cluster relative velocities. The problem of a wide range of 
predictions for W12 is not present in the case of clusters in 
sample 3, in part due to the larger number (almost a factor 
of 2) of clusters in this sample. The results obtained from dif- 
ferent correlation levels are more compatible, showing values 
differing by less than 1000 km/s among the different correla- 
tion levels. The observed behaviour of the relative velocities 
does not favour sample 4 over sample 3, indicating that the 
confirmation by angular coincidence with real clusters can 
not remedy the systematic problems of samples of clusters 
identified in two dimensions. 

This problem can be dug into a little further by study- 
ing the cluster redshift-space correlation lengths, which also 
show great variation with n z . We show these quantities in 
figure 11, where marked differences between samples of clus- 
ters identified from angular and 3-D data can be seen. Both 
samples obtained using 3-D information show a steady value 
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Figure 9. Clusters in mock sample 3 identified with different 
cluster search radius. The density of galaxies is shown as a grey- 
scale pixclmap. The circles show clusters identified using r c = 
l.Oh -1 Mpc, and the dots, those identified using r c = 0.5h -1 
Mpc. This figure shows a sub-set of the pixelised mock galaxy 
catalogue and is shown in an equal area projection centred at 
a = l h and <5 = —45°, with radius r eqa = 160°. The number of 
pixels per side is n p i x = 2900, and was set in order to obtain a 
mean number of galaxies n = 1 per pixel for a magnitude limit 
vrig X < 20.0. The pixelmap sub-set size is 110x200 pixels. 



of so, without any significant variation with n z . The first 
important result from this figure is that the results from 
clusters in sample 2 are in excellent agreement with the true 
values. We notice that the results for sample 1 are indis- 
tinguishable from those of sample 2 and are not shown for 
clarity. 

An interesting feature in figure 11, is the difference be- 
tween confirmed and unconfirmed 2-D clusters, which also 
show controversial results in figure 10. The observed corre- 
lation length of clusters in sample 4 is higher than that of 
redshift space identified clusters in sample 2. Furthermore, 
this correlation length is also higher than that of the clusters 
in sample 3, which is expected to be more contaminated by 
foreground/background galaxies. This differences could be 
explained if several spurious clusters in sample 3, were as- 
sociations of a small number of galaxies along the line of 
sight. The higher correlation amplitude derived for sample 



40 



| 30 



\ 
o 
W 



20 



~i — i — i — i — i — i — r 



i — i — i — i — i — i — i — i — i — i — i — i — r 



# Sample 2 
o Sample 3 
□ Sample 4 



o°0°OOoOO-f 



J I I I I I I I I I I I I I I I I I I I I I L 



10 

5 10 15 20 25 

Figure 11. Values of the redshift-space correlation length so as 
a function of n z . The symbols in these panels correspond to the 
different samples and are indicated in the figure. The dotted lines 
show the range of acceptable values as measured from the full 
simulation. 



4 clusters with respect to those in sample 2, could indicate 
that the identification of clusters using angular data is bi- 
ased towards massive systems. This follows from the larger 
values of so found from the sample of confirmed clusters. 
The inclusion of spurious small associations of galaxies in 
sample 3, would make the value of so in this case smaller 
than those obtained for clusters in sample 2. We have also 
confirmed this by inspection to the mean cluster mass of 
samples 2 and 4. 

These findings are in agreement with the results of 
Miller et al. (1999), who showed that by restricting a sample 
of Abell clusters to high richnesses the degree of projection 
effects is significantly reduced. The difference in correlation 
lengths also explains the marginally larger cluster relative 
velocities found in sample 4 compared with sample 3, at 
least when considering high levels of correlation, where the 
noise is not playing an important role in the results from 
sample 4. Equation 2 shows that the streaming velocities of 
clusters are proportional to their correlation functions. This 
means that the clusters in sample 4 have larger streaming 
velocities than those in sample 3. Taking into account that 
the elongations along the line of sight seen in figure 7 arc 
similar for both samples, it is clear that the results from 
equation 5 will yield larger values of cluster relative veloci- 
ties for the sample 4 than for sample 3. 

The results for the values of the bias parameter obtained 
from minimising equation 5 as a function of n z show a be- 
haviour similar to the results from the redshift-space corre- 
lation length. Figure 12 shows the resulting bias parameter 
for the different mock samples (the symbols are indicated 
in the figure). The dotted lines show the acceptable range 
of effective bias parameters from the simulation, obtained 
using the Sheth, Mo & Tormen (2001) mass function, and 
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Figure 12. Values of the bias parameter obtained from minimis- 
ing equation 5 as a function of n z . The symbols in these panels 
correspond to the different samples and are indicated in the figure. 
The dotted lines show the acceptable range of bias parameters for 
the number density of clusters in the mock samples obtained using 
the Sheth, Mo & Tormen (2001) mass function. 



the number density of clusters in the mock samples along 
with its uncertainty. 

As it can be seen, the bias found for the clusters in 
mock sample 2 is within the acceptable values for the CDM 
effective bias. This result holds for any value of n z . As in 
the previous figure, the results for sample 1 are consistent 
with the true values, and are not shown in order to pre- 
serve clarity. The result from sample 3 is in agreement for 
roughly n z > 10. Sample 4 shows significantly larger values 
of bias and correlation length, suggesting that this sample 
comprises a high mass cluster population in agreement with 
the results found earlier in this section. 



5 CONCLUSIONS 

We have analysed the consequences of different effects on the 
distortion of the correlation function of clusters in redshift 
space for different mock samples of galaxy clusters. We take 
into account the pairwise velocity distribution of galaxy sys- 
tems, coherent bulk motions, as well as redshift errors, and 
different cluster identification systematics. The mock cluster 
samples were extracted from numerical simulations, follow- 
ing algorithms that closely match the procedures used to 
identify the clusters in observational samples. We find that 
the correlation function of galaxy systems is influenced by 
wrongly assigned cluster distances due to a small number of 
galaxy redshift measurements in the fields of clusters. How- 
ever, the most important effect is that associated with the 
cluster identification procedure from two dimensional sur- 
veys. Due to these effects, the estimated mean pairwise ve- 
locity dispersion can be an order of magnitude larger than 
the actual value. 

This is consistent with the observed anisotropy of the 
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Figure 13. Correlation function results from mock 2dF100k re- 
lease galaxy groups constructed by Merchan & Zandivarez (2002). 
Shadings and line conventions are as in figure 1. 



correlation function contours of UZC groups, which shows 
the flattening produced by this infall motion (Padilla et al. 
2001). For samples of clusters identified in mock galaxy red- 
shift surveys we find an infalling pattern even in the case 
where the distances to groups or clusters of galaxies are ob- 
tained using a small number of member redshifts. 

In order to provide predictions for new generation clus- 
ter catalogues we have also analysed a mock sample of 
groups resembling those obtained by Merchan & Zandivarez 
(2002) from the 2dF100K release galaxy redshift survey. 
This mock catalogue is subject to the same complicated an- 
gular mask as the observations and serves as a further test 
of the reliability of the information that can be obtained 
from observational cluster and group catalogues. The re- 
sults shown in figure 13 are similar to those of Sample 2, 
and indicate that the angular mask does not affect severely 
the results so that consistent values of 1012 and so can be 
derived. 

In order to explore the projection effects afflicting clus- 
ter samples identified in two dimensions we analysed dif- 
ferent mock cluster samples identified from mock angular 
catalogues using different cluster search radius. Our find- 
ings favour the use of smaller radius than that used in the 
identification of Abell clusters. The use of a too small value 
produces a sample increasingly affected by projection effects 
and the inclusion of smaller groups of galaxies which are 
probably cluster satellite haloes. 

We notice that 2-dim identified clusters whose angular 
positions coincide with a true 3-dim cluster show a strongly 
distorted clustering pattern. This fact shows the large influ- 
ence of groups and clusters along the line of sight in the ob- 
served elongation. Contamination is therefore mainly arising 
from structures present in the filamentary large scale distri- 
bution. These results indicate the difficulties of obtaining 
unbiased cluster samples even when using accurate photo- 
metric data such as present in new and forthcoming surveys. 
Redshift information is crucial for constructing cluster sam- 
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pies which lack significant projection effects, since galaxy 
colours alone can remove foreground and background con- 
tamination but not distinguish membership to clusters at 
relatively smaller separations along the line of sight. 
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